A Programming Language With a POMDP Inside

نویسندگان

  • Christopher H. Lin
  • Mausam
  • Daniel S. Weld
چکیده

We present POAPS, a novel planning system for defining PartiallyObservable Markov Decision Processes (POMDPs) that abstracts away from POMDP details for the benefit of non-expert practitioners. POAPS includes an expressive adaptive programming language based on Lisp that has constructs for choice points that can be dynamically optimized. Non-experts can use our language to write adaptive programs that have partially observable components without needing to specify belief/hidden states or reason about probabilities. POAPS is also a compiler that defines and performs the transformation of any program written in our language into a POMDP with control knowledge. We demonstrate the generality and power of POAPS in the rapidly growing domain of human computation by describing its expressiveness and simplicity by writing several POAPS programs for common crowdsourcing tasks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Developing a software for simulation of gaseous detectors with Monte carlo method in c++ programming language

In this paper we consider a gaseous detector and supposed, because of pass of an ionizing radiation, an electron created inside it. By numerical simulation with monte carlo method and concluding the impacts, scatterings and creation of secondary electrons, we find the trajectory of initial and secondary electrons. Dependence of number of secondary electrons to applied electrical field is invest...

متن کامل

From an Agent Logic to an Agent Programming Language for Partially Observable Stochastic Domains

Broadly speaking, my research concerns combining logic of action and POMDP theory in a coherent, theoretically sound language for agent programming. We have already developed a logic for specifying partially observable stochastic domains. A logic for reasoning with the models specified must still be developed. An agent programming language will then be developed and used to design controllers f...

متن کامل

Reinforcement Learning in Partially Observable Markov Decision Processes using Hybrid Probabilistic Logic Programs

We present a probabilistic logic programming framework to reinforcement learning, by integrating reinforcement learning, in POMDP environments, with normal hybrid probabilistic logic programs with probabilistic answer set semantics, that is capable of representing domain-specific knowledge. We formally prove the correctness of our approach. We show that the complexity of finding a policy for a ...

متن کامل

A BDI Agent Architecture for a POMDP Planner

Traditionally, agent architectures based on the BeliefDesire-Intention (BDI) model make use of precompiled plans, or if they do generate plans, the plans do not involve stochastic actions nor probabilistic observations. Plans that do involve these kinds of actions and observations are generated by partially observable Markov decision process (POMDP) planners. In particular for POMDP planning, w...

متن کامل

Optimizing Fixed-Size Stochastic Controllers for POMDPs

In this paper, we discuss a new approach that represents POMDP policies as finite-state controllers and formulates the optimal policy of a desired size as a nonlinear program (NLP). This new representation allows a wide range of powerful nonlinear programming algorithms to be used to solve POMDPs. Although solving the NLP optimally is often intractable, the results we obtain using an off-theshe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1608.08724  شماره 

صفحات  -

تاریخ انتشار 2016